Object Priors for Classifying and Localizing Unseen Actions

نویسندگان

چکیده

Abstract This work strives for the classification and localization of human actions in videos, without need any labeled video training examples. Where existing relies on transferring global attribute or object information from seen to unseen action we seek classify spatio-temporally localize videos image-based only. We propose three spatial priors, which encode local person detectors along with their relations. On top introduce semantic extend matching through word embeddings simple functions that tackle ambiguity, discrimination, naming. A embedding combines priors. It enables us a new retrieval task retrieves tubes collections based user-specified objects, relations, size. Experimental evaluation five datasets shows importance priors actions. find persons objects have preferred relations benefit localization, while using multiple languages filtering directly improves matching, leading state-of-the-art results both localization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifying Unseen Cases with Many

Handling missing attribute values is an important issue for classiier learning, since missing attribute values in either training data or test (unseen) data aaect the prediction accuracy of learned classi-ers. In many real KDD applications, attributes with missing values are very common. This paper studies the robustness of four recently developed committee learning techniques, including Boosti...

متن کامل

Classifying Unseen Cases with Many Missing Values

Handling missing attribute values is an important issue for classiier learning, since missing attribute values in either training data or test (unseen) data aaect the prediction accuracy of learned classiiers. In many real KDD applications, attributes with missing values are very common. This paper studies the robust-ness of four recently developed committee learning techniques, including Boost...

متن کامل

Classifying and Coding Online Actions

Research on how the Internet is diffusing across the population has broadened from questions about who uses the medium to what people do during their time online. With this change in focus comes a need for more detailed data on people’s online actions. The author provides a method for coding and classifying users’online information-seeking behavior. The author presents an exhaustive list of way...

متن کامل

Classifying Facial Actions

The Facial Action Coding System (FACS) [23] is an objective method for quantifying facial movement in terms of component actions. This system is widely used in behavioral investigations of emotion, cognitive processes, and social interaction. The coding is presently performed by highly trained human experts. This paper explores and compares techniques for automatically recognizing facial action...

متن کامل

Comparing pixel-based and object-based algorithms for classifying land use of arid basins (Case study: Mokhtaran Basin, Iran)

In this research, two techniques of pixel-based and object-based image analysis were investigated and compared for providing land use map in arid basin of Mokhtaran, Birjand. Using Landsat satellite imagery in 2015, the classification of land use was performed with three object-based algorithms of supervised fuzzy-maximum likelihood, maximum likelihood, and K-nearest neighbor. Nine combinations...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Vision

سال: 2021

ISSN: ['0920-5691', '1573-1405']

DOI: https://doi.org/10.1007/s11263-021-01454-y